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Abstract 

This paper initiates the study of hidden variables from the discrete, abstract perspective of 
quantum computing. For us, a hidden-variable theory is simply a way to convert a unitary 
matrix that maps one quantum state to another, into a stochastic matrix that maps the initial 
probability distribution to the final one in some fixed basis. We list seven axioms that we 
might want such a theory to satisfy, and then investigate which of the axioms can be satisfied 
simultaneously. Toward this end, we construct a new hidden- variable theory that is both robust 
to small perturbations and indifferent to the identity operation, by exploiting an unexpected 
connection between unitary matrices and network flows. We also analyze previous hidden- 
variable theories of Dieks and Schrodinger in terms of our axioms. In a companion paper, we 
will show that actually sampling the history of a hidden variable under reasonable axioms is 
at least as hard as solving the Graph Isomorphism problem; and indeed is probably intractable 
even for quantum computers. 

1 Introduction 

Quantum mechanics lets us calculate the probability that (say) an electron will be found in an 
excited state if measured at a particular time. But it is silent about multiple-time or transition 
probabilities: that is, what is the probability that the electron will be in an excited state at time ti, 
given that it was in its ground state at an earlier time to? The usual response is that this question 
is meaningless, unless of course the electron was measured (or otherwise known with probability 
1) to be in its ground state at io- A different response — pursued by Schrodinger |22], Bohm 0, 
Bell [2, Nelson ^5], Dieks ^0], and others — treats the question as provisionally meaningful, and 
then investigates how one might answer it mathematically. Specific attempts at answers are called 
"hidden-variable theories." 

The appeal of hidden-variable theories is that they provide one possible solution to the mea- 
surement problem. For they allow us to apply unitary quantum mechanics to the entire universe 
(including ourselves), yet still discuss the probability of a future observation conditioned on our 
current observations. Furthermore, they let us do so without making any assumptions about de- 
coherence or the nature of observers. For example, even if an observer were placed in coherent 
superposition, that observer would still have a sequence of definite experiences, and the probability 
of any such sequence could be calculated. 

This paper initiates the study of hidden variables from a quantum computing perspective. We 
restrict our attention to the simplest possible setting: that of discrete time, a finite-dimensional 
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Hilbert space, and a fixed orthogonal basis. Within this setting, we reformulate known hidden- 
variable theories due to Dieks jlOj and Schrodinger j22j . and also introduce a new theory based 
on network flows. However, our main contribution is the axiomatic approach that we use. We 
propose seven axioms for hidden- variable theories in our setting, and then compare theories against 
each other based on which of the axioms they satisfy. A central question in this approach is which 
subsets of axioms can be satisfied simultaneously. 

In a companion paper p], we will make the connection to quantum computing explicit, by prov- 
ing that under any hidden- variable theory that satisfies three reasonable axioms (called symmetry, 
indifference, and robustness), the ability to examine one's entire "history" through a quantum sys- 
tem would entail the ability to solve the Graph Isomorphism problem in polynomial time. What 
makes this result surprising is that, in the so-called oracle or black-box model, sampling histories 
would not entail the ability to solve NP-complete problems in polynomial time. We thus obtain 
the first good example of a computational model that appears "slightly" more powerful than the 
usual quantum computing model. 

This paper lays the groundwork for the computational results of (Ij, in particular by showing 
that there exists a hidden-variable theory satisfying the symmetry, indifference, and robustness 
axioms. 

The paper is organized as follows. Section fl.ll formallv defines hidden- variable theories in our 
sense; then Section [1.21 contrasts these theories with related ideas such as Bohmian mechanics and 
modal interpretations. Section 11.31 addresses the most common objections to our approach: for 
example, that the implicit dependence on a fixed basis is unacceptable. 

In Section [2 we introduce seven possible axioms for hidden-variable theories. These are 
symmetry under permutation of basis states; indifference to the identity operation; robustness 
to small perturbations; "block robustness," a weaker version of robustness; commutativity with 
respect to spacelike-separated unitaries; commutativity for the special case of product states; and 
invariance under decomposition of mixed states into pure states. Ideally, a theory would satisfy 
all of these axioms. However, we show in Section |31 that no theory satisfies both indifference and 
commutativity; no theory satisfies both indifference and a stronger version of robustness; no theory 
satisfies indifference, robustness, and decomposition invariance; and no theory satisfies a stronger 
version of decomposition invariance. The proofs of two of these results use the same geometric 
facts that underlie the Bell inequalities. 

In Section |1] we shift from negative to positive results. Section [4.11 presents a hidden- variable 
theory called the flow theory or J^T, which is based on the Max-Flow-Min-Cut theorem from 
combinatorial optimization. The idea is to define a network of "pipes" from basis states at an 
initial time to basis states at a final time, and then route as much probability mass as possible 
through these pipes. The capacity of each pipe depends on the corresponding entry of the unitary 
acting from the initial to final time. To find the probability of transitioning from basis state \i) to 
basis state \ j), we then determine how much of the flow originating at \i) is routed along the pipe 
to Our main results are that J-T is well-defined and that it is robust to small perturbations. 
Since J^T trivially satisfies the indifference axiom, this implies that the indifference and robustness 
axioms can be satisfied simultaneously, which was not at all obvious a priori. The flow theory also 
satisfies symmetry, but not product commutativity, scalar invariance, or decomposition invariance. 

Section 14.21 presents a second theory that we call the Schrodinger theory or ST, since it is 
based on a pair of integral equations introduced in a 1931 paper of Schrodinger [221 • Schrodinger 
conjectured, but was unable to prove, the existence and uniqueness of a solution to these equations; 
the problem was not settled until the work of Nagasawa JSI in the 1980's. In our discrete setting 
the problem is simpler, and we give a self-contained proof of existence using a matrix scaling 
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technique due to Sinkhorn |2S]- The idea is as follows: we want to convert a unitary matrix that 
maps one quantum state to another, into a nonnegative matrix whose i^^ column sums to the initial 
probability of basis state and whose j^^ row sums to the final probability of basis state To 
do so, we first replace each entry of the unitary matrix by its absolute value, then normalize each 
column to sum to the desired initial probability, then normalize each row to sum to the desired 
final probability. But then the columns are no longer normalized correctly, so we normalize them 
again, then normalize the rows again, and so on. We show that this iterative process converges, 
from which it follows that ST is well-defined. We also show that ST satisfies the symmetry, 
indifference, and product commutativity axioms; and violates the decomposition invariance axiom. 
We conjecture that ST satisfies the robustness axiom; proving that conjecture is the main open 
problem of the paper. We conclude in Sectional 



1.1 Hidden- Variable Theories 

Suppose we have an N x N unitary matrix U, acting on a state 

= ai|l) + --- + ajv|A^), 
where |1) , . . . , |A^) is a standard orthogonal basis. Let 

U\iP)=Pi\1) + --- + (3n\N). 
Then can we construct a stochastic matrix S, which maps the vector of probabilities 



P 



induced by measuring to the vector 



|Q!l| 



\aN\ 



induced by measuring U \ 



Trivially yes. The following matrix maps any vector of probabilities 



to q , ignoring the input vector p entirely: 



1/3. 



1/3. 



Here VT stands for product theory. The product theory corresponds to a strange picture of physical 
reality, in which memories and records are completely unreliable, there being no causal connection 
between states of affairs at earlier and later times. 

So we would like S to depend on U itself somehow, not just on |^) and U \ ip). Indeed, ideally 
S would be a function only of U, and not of But this is impossible, as the following example 
shows. Let U he a 7r/4 rotation, and let |+) = (|0) + |1)) /\/2 and |-) = (|0) - |1)) /\/2. Then 
U \+) = |1) implies that 

s{\+),u)=\l J 
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whereas U\—) = |0) implies that 

On the other hand, it is easy to see that, if S can depend on 1-0) as well as C/, then there are 
infinitely many choices for the function S ,U). Every choice reproduces the predictions of 
quantum mechanics perfectly when restricted to single-time probabilities. So how can we possibly 
choose among them? Our approach in Sections [21 and will be to write down axioms that we 
would like S to satisfy, and then investigate which of the axioms can be satisfied simultaneously. 

Formally, a hidden-variable theory is a family of functions {»S'Ar}^>-|^, where each maps an 
A^-dimensional mixed state p and an x unitary matrix U onto a singly stochastic matrix 
Sn {p, U). We will often suppress the dependence on N , and use subscripts such as VT or J^T to 
indicate the theory in question. Also, if p = (V'l is a pure state we may write S ^U) instead 

Let {M)-j denote the entry in the i*'' column and j*^ row of matrix M. Then (5')^^ is the 
probability that the hidden variable takes value \j) after U is applied, conditioned on it taking 
value \i) before U is applied. At a minimum, any theory must satisfy the following marginalization 
axiom: for all j G {1, . . . , A'^}, 

i 

This says that after U is applied, the hidden variable takes value |j) with probability {U pU~^^ 
which is the usual Born probability. 

Often it will be convenient to refer, not to S itself, but to the matrix P {p, U) of joint probabilities 
whose {i,j) entry is {P)ij = The i^^ column of P must sum to and the j*'' row 

must sum to {U pU~^^ ... Indeed, we will define the theories TT and ST by first specifying the 
matrix P, and then setting {S)^j := {P)ij / {p)ii- This approach has the drawback that if [p)n = 0, 
then the column of S is undefined. To get around this, we adopt the convention that 

S{p,U) := \\m S{p,,U) 

£— >0+ 

where = (1 — e) /9 + e/ and I is the N x N maximally mixed state. Technically, the limits 



lim 



£^0+ {ps)ii 

might not exist, but in the cases of interest to us it will be obvious that they do. 



1.2 Comparison with Previous Work 

Before going further, we should contrast our approach with previous approaches to hidden variables, 
the most famous of which is Bohmian mechanics 5 . Our main criticism of Bohmian mechanics 
is that it commits itself to a Hilbert space of particle positions and momenta. Furthermore, it is 
crucial that the positions and momenta be continuous, in order for particles to evolve determin- 
istically. To see this, let \L) and \R) be discrete positions, and suppose a particle is in state \L) 
at time ti, and state (|-L) + |-R)) /\/2 at a later time t2- Then a hidden variable representing the 
position would have entropy at ti, since it is always \L) then; but entropy 1 at t2, since it is 
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\L) or \R) both with 1/2 probabihty. Therefore the earher value cannot determine the later one.^ 
It follows that Bohmian mechanics is incompatible with the belief that all physical observables 
are discrete. But in our view, there are strong reasons to hold that belief, which include black 
hole entropy bounds; the existence of a natural minimum length scale (10~^^ cm); results on area 
quantization in quantum gravity 21 ; the fact that many physical quantities once thought to be 
continuous have turned out to be discrete; the infinities of quantum field theory; the implausibility 
of analog "hypercomputers" ; and conceptual problems raised by the independence of the continuum 
hypothesis. 

Of course there exist stochastic analogues of Bohmian mechanics, among them Nelsonian me- 
chanics and Bohm and Hiley's "stochastic interpretation" [2]. But it is not obvious why we 
should prefer these to other stochastic hidden- variable theories. Prom a quantum-information 
perspective, it is much more natural to take an abstract approach — one that allows arbitrary finite- 
dimensional Hilbert spaces, and that does not rule out any transition rule a priori. 

Stochastic hidden variables have also been considered in the context of modal interpretations; 
see Dickson (Hj, Bacciagaluppi and Dickson f2l, and Dieks [IDj for example. However, the central 
assumptions in that work are extremely different from ours. In modal interpretations, a pure state 
evolving unitarily poses no problems at all: one simply rotates the hidden-variable basis along with 
the state, so that the state always represents a "possessed property" of the system in the current 
basis. Difficulties arise only for mixed states; and there, the goal is to track a whole set of possessed 
properties. By contrast, our approach is to fix an orthogonal basis, then track a single hidden 
variable that is an element of that basis. The issues raised by pure states and mixed states are 
essentially the same. 

Finally we should mention the consistent-histories interpretation of Griffiths ^1 and Gell-Mann 
and Hartle jl4j . This interpretation assigns probabilities to various histories through a quantum 
system, so long as the "interference" between those histories is negligible. Loosely speaking, then, 
the situations where consistent histories make sense are precisely the ones where the question of 
transition probabilities can be avoided. 

I. 3 Objections 

Hidden-variable theories, as we define them, are open to several technical objections. For exam- 
ple, we required transition probabilities for only one orthogonal observable. What about other 
observables? The problem is that, according to the Kochen-Specker theorem, we cannot assign 
consistent values to all observables at any single time, let alone give transition probabilities for 
those values. This is an issue in any setting, not just ours. The solution we prefer is to postulate 
a fixed orthogonal basis of "distinguishable experiences," and to interpret a measurement in any 
other basis as a unitary followed by a measurement in the fixed basis. As mentioned in Section 

II. 2( modal interpretations opt for a different solution, which involves sets of bases that change over 
time with the state itself. It might be interesting to combine the approaches. 

Another objection is that the probability of transitioning from basis state \i) at time ti to basis 
state \j) at time t2 might depend on how finely we divide the time interval between ti and t2- In 
other words, for some state It/') and unitaries V, W, we might have 

sm,w v) ^S{V 1^) , W) S (IV) , V) 

^Put differently, Bohm's conservation of probability result breaks down because the "wavefunctions" at ti and 
t2 are degenerate, with all amplitude concentrated on finitely many points. But in a discrete Hilbert space, every 
wavefunction is degenerate in this sense! 
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(a similar point was made by Gillespie Indeed, this is true for any hidden- variable theory 

other than the product theory VT . To see this, observe that for all unitaries U and states there 
exist unitaries V, W such that U = WV and V = Then applying V destroys all information 
in the hidden variable (that is, decreases its entropy to 0); so if we then apply W , then the variable's 
final value must be uncorrelated with the initial value. In other words, S {V IV') , W) S , V) 
must equal S-pq- dV') It follows that to any hidden- variable theory we must associate a time 

scale, or some other rule for deciding when the transitions take place. 

In our defense, let us point out that exactly the same problem arises in continuous-time 
stochastic hidden- variable theories. For if a state is governed by the Schrodinger equation 
dlijj) /dt = iHt IV'), and a hidden variable's probability distribution ~p is governed by the stochastic 
equation d'p /dr = Ar~p , then there is still an arbitrary parameter dr/dt on which the dynamics 
depend. 

Finally, it will be objected that we have ignored special relativity. In Section [21 we will define 
a commutativity axiom, which informally requires that the stochastic matrix S not depend on the 
temporal order of spacelike separated events. Unfortunately, we will see that when entangled states 
are involved, commutativity is irreconcilable with another axiom that seems even more basic. The 
resulting nonlocality has the same character as the nonlocality of Bohmian mechanics — that is, one 
cannot use it to send superluminal signals in the usual sense, but it is unsettling nonetheless. 



2 Axioms for Hidden- Variable Theories 

We now state seven axioms that we would like hidden-variable theories to satisfy. 

Symmetry. A theory is symmetric if it is invariant under relabeling of basis states: that is, 
if for all p, U and all permutation matrices Q, 

Q'^S {p, U)Q = S {Q-^pQ, Q-^UQ) . 

All theories discussed in this paper are symmetric. 

Indifference. Suppose we partition the basis states into 'blocks,' between which U can never 
produce interference. Call an ordered pair (/, J) of subsets of {1, . . . , A^} a block if (C^)jj = for 
all i G / and j ^ J, as well as alH ^ / and j £ J. Also, call (/, J) a minimal block if |I| = \J\ and 
no (/*, J*) with I* C I and J* C J is a block. Note that if (/i, Ji) , . . . , {Im, Jm) are the minimal 
blocks, then both {/i, . . . , 1^} and { Ji, . . . , Jm} are partitions of {1, ... , A^}. We say a theory is 
indifferent if it never produces interference between minimal blocks — that is, if {S)^j = for all i, j 
in different minimal blocks. In particular, indifference implies that given any state p in a tensor 
product space TLa ^'Hb, and any unitary U that acts only on TLa (that is, is the identity ouTLb), 
the stochastic matrix S (p, U) acts only on TIa as well. 

Robustness. A theory is robust if it is insensitive to small errors in a state or unitary (which, 
in particular, implies continuity). Suppose we obtain p and U by making small changes to p and 
U respectively. Then for all polynomials p, there should exist a polynomial q such that for all A^, 



P[p,U]-P{p, U) 



< 



1 



p{N) 



where 11 Ml 



(M) 



whenever — /9|| < l/q{N) and U — U 
has an important advantage for quantum computing: if a hidden-variable theory is robust then 
the set of gates used to define the unitaries Ui, . . . , Ut is irrelevant, since by the Solovay-Kitaev 



< l/q{N). Robustness 



6 



Theorem (see any universal quantum gate set can simulate any other to a precision e with 

0(log'^l/e) overhead. 

Block Robustness. Unfortunately, one of the theories that we wish to study does not satisfy 
robustness. We therefore define a weaker notion of robustness that this theory satisfies. We say 
a hidden-variable theory is block robust if robustness holds for all modifications p,U p,U such 
that U and U have the same set (/i, Ji) , . . . , {Im, Jm) of minimal blocks. 

Commutativity. Let pab be a bipartite state, and let Ua and U b act only on subsystems A 
and B respectively. Then commutativity means that the order in which Ua and Ub are applied is 
irrelevant: 

S {UapabU^^, Ub) S {pab, Ua) = S {UbpabUj^^, Ua) S {pab, Ub) ■ 

Product Commutativity. A theory is product commutative if it satisfies commutativity for 
all separable pure states \ip) = \iPa) 'iS> \iPb)- 

Decomposition Invariance. A theory is decomposition invariant if 

N 

S{p,U) = Y,p,S{\i;i) {^l^,\,U) 

for every decomposition 

N 

j=i 

of p into pure states. Theorem |2l part (ii) will show that the analogous axiom for P {p, U) is 
unsatisfiable. 



2.1 Comparing Hidden- Variable Theories 

To fix ideas, let us compare some hidden-variable theories with respect to the above axioms. We 
have already seen the product theory VT in Section 11.11 It is easy to show that VT satisfies 
symmetry, robustness, commutativity, and decomposition invariance. However, we consider VT 
unsatisfactory because it violates indifference: even if a unitary U acts only on the first of two 
qubits, S-pT {p, U) will readily produce transitions between (say) |00) and |01), or between |01) and 
1 10). 

Recognizing this problem, Dieks 10; proposed an alternative theory that in our terminology 
corresponds to the following.^ First find the minimal blocks (/i, Ji) , . . . , {Im, Jm) of U . Then 
apply the product theory separately to each minimal block; that is, if i and j belong to the same 
block (/, J) then set 



T.i,AUpU-'y, 



and otherwise set {S)ij = 0. The resulting Dieks theory, T>T , clearly satisfies indifference. However, 
it does not satisfy robustness (or even continuity), since the set of minimal blocks can change if we 
replace '0' entries in U by arbitrarily small nonzero entries. 

In Section |1] we will introduce two other hidden- variable theories, the flow theory and the 
Schrodinger theory ST. The following table lists which axioms the four theories satisfy. 

^Dieks (personal communication) says he would no longer defend this theory. 
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V7 (Product) 


V7 (Dieks) 


J=7 (Flow) 


5r 


Symmetry 


Yes 


Yes 


Yes 


Yes 


Indifference 


No 


Yes 


Yes 


Yes 


Robustness 


Yes 


No 


Yes 


? 


Block Robustness 


Yes 


Yes 


Yes 


? 


Commutativity 


Yes 


No 


No 


No 


Product Commutativity 


Yes 


Yes 


No 


Yes 


Decomposition Invariance 


Yes 


Yes 


No 


No 



(Schrodinger) 



If we could prove that 5T satisfies robustness, then the above table together with the impos- 
sibility results of Section 13 would completely characterize which of the axioms can be satisfied 
simultaneously. 

3 Impossibility Results 

This section shows that certain sets of axioms cannot be satisfied by any hidden-variable theory. 
We first show that the failure of PT, TT ^ and ST to satisfy commutativity is inherent, and not a 
fixable technical problem. 

Theorem 1 No hidden-variable theory satisfies both indifference and commutativity. 

Proof. Assume indifference holds, and let our initial state be = (|00) + |11)) /\/2. Suppose 
Ua applies a vr/S rotation to the first qubit, and Ub applies a — vr/S rotation to the second qubit. 
Then 

Ua IV') = Ub IV') = ^ (cos ^ |00) - sin ^ |01) + sin ^ |10) + cos ^ , 

UaUb IV') = UbUa \^) = \ (|00) - |01) + |10) + 111)) . 

Let ft be the value of the hidden variable after t unitaries have been applied. Let E be the event 
that vq = 1 00) initially, and V2 = |10) at the end. If Ua is applied before Ub, then the unique 
'path' from vq to V2 consistent with indifference sets vi = |10). So 

Ft:[E] < Pr[vi = |10)] = - sin^ -. 

2 8 

But if f/^ is applied before Ua, then the probability that vq = |11) and V2 = |10) is at most ^ sin^ ^, 
by the same reasoning. Thus, since V2 must equal |10) with probability 1/4, and since the only 
possibilities for vq are |00) and |11), 

„r„-, 1 1 971" 1 nvr 

Pr LE > sin^ - > - sin^ -. 

^^- 4 2 8 2 8 

We conclude that commutativity is violated. ■ 

Let us remark on the relationship between Theorem ^ and Bell's Theorem. Any hidden- 
variable theory that is "local" in Bell's sense would immediately satisfy both indifference and 
commutativity. However, the converse is not obvious, since there might be nonlocal information 
in the states Ua \tp) or Ub \'4'), which an indifferent commutative theory could exploit but a local 
one could not. Theorem ^ rules out this possibility, and in that sense is a strengthening of Bell's 
Theorem. 

The next result places limits on decomposition invariance. 
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Theorem 2 

(i) No theory satisfies indifference, robustness, and decomposition invariance. 

(ii) No theory has the property that 

N 



i=l 



for every decomposition YliLiPi iV'j) (V'il of p. 
Proof. 

(i) Suppose the contrary. Let 



Re 



cos t) — sm ( 
sin 6 cos 6 



\<fe) = cos6'|0) + sin 6*11) 
Then for every 6 not a multiple of 7r/2, we must have 



S {\^7T/2-e) ,Re) 



1 1 





1 1 



So by decomposition invariance, letting / = (|0) (0| + |1) (1|) /2 denote the maximally mixed 
state, 



S (/, Rg) — S 



, Rff 





- 1 


1 - 




2 


2 


)- 


1 


1 




. 2 


2 . 



and therefore 



PiI,Re 



(p)oo 

2 2 
(p)oo 

2 2 





■ 1 


1 - 




4 


! 




1 




. 4 


4 . 



By robustness, this holds for 9 = as well. But this is a contradiction, since by indifference 
P (/, i?o) must be half the identity. 

(ii) Reminiscent of Theorem^ Suppose the contrary; then 

P (I, R^/s) = 
So considering transitions from |0) to 

_ (P(|0),i?,/8)),,+0 _ 1 . 



But 



[P {l,Rn/8)) 



P{i,Rn/s) 



01 



— sm — . 

2 8 



^(|v'7r/8),^7r/8) + P {\V5n /s) , Rn /s) 
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also. Since iJ^/g \'f-K/s) = 19577/4)5 we have 



P(/,i?./8))oi>Tj(^(kv8>,«V8)) 



01 



which is a contradiction. 



1/1 . vr 
> - - - sin^ - 
- 2 I 2 8 



1 . 2 

>-sm 



vr 



Notice that aU three conditions in Theorem|21 part (i) were essential — for VT satisfies robustness 
and decomposition invariance, T>T satisfies indifference and decomposition invariance, and J-T 
satisfies indifference and robustness. 

Our last impossibility result says that no hidden-variable theory satisfies both indifference and 
"strong continuity," in the sense that for all e > there exists 5 > such that ||p — p|| < 5 implies 
||5 {p, U)-S {p, U)\\ < e. To see this, let 



U 



1 







^ 

^ ^ 



p=Vl-26^\0) + 6\l) + 6\2), 
p = \/l-252|0)+<5|l)-(5|2). 



Then by indifference. 



Sip,U) 



1 


oil 



Sip,U) 



1 
1 1 




This is the reason why we defined robustness in terms of the joint probabilities matrix P rather 
than the stochastic matrix S. On the other hand, note that by giving up indifference, we can 
satisfy strong continuity, as is shown by J^T. 



4 Specific Theories 

This section presents the main results of the paper, which concern two nontrivial examples of 
hidden- variable theories: the flow theory in Section f4. 11 and the Schrodinger theory in Section f4. 21 



4.1 Flow Theory 

The idea of the flow theory is to convert a unitary matrix into a weighted directed graph, and then 
route probability mass through that graph like oil through pipes. Given a unitary U, let 



" /3i " 






















_ UN _ 
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Figure 1: A network (weighted directed graph with source and sink) corresponding to the unitary 

U and state |^) 



where for the time being 



IV) = ai|l) + --- + a^|A^), 
U\i;)=Pi\l) + --- + PN\N) 

are pure states. Then consider the network G shown in Figure ^ We have a source vertex s, a 
sink vertex t, and input and N output vertices labeled by basis states |1) , . . . , |A^). Each edge 
of the form (s, has capacity |aj|^, each edge , |j)) has capacity {U)-j , and each edge , t) 

has capacity A natural question is how much probability mass can flow from s to t without 

violating the capacity constraints. Rather surprisingly, we show that one unit of mass (that is, 

2 

,|j)) had capacity (U)-- (or 



all of it) can. Interestingly, this result would be false if edge 

l+e 



even 



instead of 



{U) 



We also show that there exists a mapping from networks to 
maximum flows in those networks, that is robust in the sense that a small change in edge capacities 
produces only a small change in the amount of flow through any edge. 

The proofs of these theorems use classical results from the theory of network flows (see [7| for an 
introduction). In particular, let a cut be a set of edges that separates s from t; the value of a cut 
is the sum of the capacities of its edges. Then a fundamental result called the Max- Flow- Min- Cut 
Theorem 11; says that the maximum possible amount of flow from s to t equals the minimum value 
of any cut. 

Theorem 3 One unit of flow can be routed from s to t in G. 

Proof. By the above, it suffices to show that any cut C in G has value at least 1. Let A be 
the set of z G {1, . . . , A^} such that (s, \i)) ^ C, and let B be the set of j such that ,t) ^ C. 
Then C must contain every edge {\i) , \j)) such that i £ A and j G B, and we can assume without 
loss of generality that C contains no other edges. So the value of C is 



i^A jt^B i&A, jeB 

Therefore we need to prove the matrix inequality 



(U) 



1 



1 




> 1, 
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or 



1+ E |(f^)..|>Ei«^i' + Ei/^^-i'- 

Let U be fixed, and consider the maximum of tlie riglit-hand side over all Since 

/5. = E(^)u-"- 

i 

this maximum is equal to the largest eigenvalue A of the positive semidefinite matrix 

Ei^) (^i + Ei^j)^^ji 

where for each j, 

\u,) = {U\^\l) + --- + {U)^^\N). 

Let Ha be the subspace of states spanned by : i G ^4}, and let Hb be the subspace spanned by 
{\uj) : j G B}. Also, let La be the length of the projection of onto Ha, and let Lb {\ip)) 
be the length of the projection of {ip) onto Hb- Then since the \i)'s and \uj)^s form orthogonal 
bases for Ha and Hb respectively, we have 



max(L^(|V»'+LB(|V»' 



So letting 9 be the angle between Ha and Hb, 



A = 2 cos^ - 

= 1 + cos 6 
< 1 + 



max l(a|6)| 



= 1 + max 

1 71 

|5i|'+-+|5ivP=l 

<i+ E |(^) 



|7i|'+-+|7iVp=l 




which completes the theorem. ■ 

Observe that Theorem 01 still holds if U acts on a mixed state p, since we can write p as a 
convex combination of pure states \^p) {^p\, construct a flow for each \il>) separately, and then take 
a convex combination of the flows. 

Using Theorem |21 we now define the flow theory J^T. Let F (p, U) be the set of maximum 



flows for p, U — representable by x arrays of real numbers fij such that < fij < 
all and also 



for 



12 



Clearly F {p, U) is a convex polytope, which Theorem asserts is nonempty. Form a maximum 
flow /* (/O, U) G F {p, U) as follows: first let fl^ be the maximum of /n over all / G F (p, U). Then 
let be the maximum of /12 over all / G F (p, [/) such that /n = /f;^. Continue to loop through 
all i,j pairs in lexicographic order, setting each f*- to its maximum possible value consistent with 
the {i — 1) N + j — 1 previous values. 

We define the joint probabilities matrix P by "symmetrizing" /* (p, U) over permutations of 
basis states — that is, 

Q 

where Q ranges over all x permutation matrices. As discussed in Section Fl. 11 given P we can 
easily obtain the stochastic matrix S by dividing the i''^ column by or taking a limit in case 
= 0. It is easy to check that J-T so defined satisfies the symmetry and indifference axioms. 
Showing that J-T satisfies robustness is harder. Our proof is based on the Ford-Fulkerson 
algorithm , a classic algorithm for computing maximum fiows that works by finding a sequence 
of "augmenting paths," each of which increases the fiow from s to t by some positive amount. 

Theorem 4 J^T satisfies robustness. 

Proof. Let G be an arbitrary flow network with source s, sink t, and directed edges ei, . . . , e^, 
where each has capacity Cj and leads from Vi to Wi . It will be convenient to introduce a flctitious 
edge eo from t to s with unlimited capacity; then maximizing the flow through G is equivalent to 
maximizing the flow through cq. Suppose we produce a new network G by increasing a single 
capacity Cj* by some e > 0. Let /* be the optimal flow for G, obtained by first maximizing the 
flow J'o through cq, then maximizing the flow /i through ei holding /q flxed, and so on up to fm- 
Let /* be the maximal flow for G produced in the same way. We claim that for all i G {0, . . . , m}, 



i Ji 



< e. 



To see that the theorem follows from this claim: flrst, if /* is robust under adding e to Cj*, then 
it must also be robust under subtracting e from Cj* . Second, if we change p, U to p,U such that 



\\p- p\\ <l/q{N) and U-U 
changed one by one, so that 



< 1/q (N), then we can imagine the N'^ + 2N edge capacities are 



r (p,u)-r iP,u)\\<Y^ 



u 



UpU-')^^-{UpU-')^ 



33 



< 



4iV2 

W)' 



(Here we have made no attempt to optimize the bound.) Third, symmetrizing over all row and 
column permutations can only decrease P — P 



not increase it. 



We now prove to the claim. To do so we describe an iterative algorithm for computing /*. First 
maximize the fiow /o through cq, by using the Ford-Fulkerson algorithm to find a maximum fiow 
from s to t. Let /(''^ be the resulting flow, and let G^^^ be the residual network that corresponds 



to f^^\ For each i, that is, G^^^ has an edge = {vi,Wi) of capacity c[^^ = Ci — ff^\ and an edge 



f(0) 
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= {wi,Vi) of capacity c^^'^ = f-'^'. Next maximize /i subject to /o by using the Ford-Pulkerson 
algorithm to find "augmenting cycles" from wi to vi and back to wi in G^^^ \ {eo,eo}. Continue 
in this manner until each of /i, . . . , /m has been maximized subject to the previous /j's. Finally 
set /* = 

Now, one way to compute /* is to start with /*, then repeatedly "correct" it by applying the 



ciO) 



same iterative algorithm to maximize /o, then /i, and so on. Let £i 



n-n 



; then we need 

to show that Ei < e for all i E {0, . . . ,'m,}. The proof is by induction on i. Clearly eq < e, 
since increasing a* by £ can increase the value of the minimum cut from s to t by at most e. 
Likewise, after we maximize /o, the value of the minimum cut from wi to vi can increase by at 
most E — Eo + Eq = E. For of the at most e new units of flow from wi to vi that increasing Cj* made 
available, Eq of them were "taken up" in maximizing /o, but the process of maximizing /o could 
have again increased the minimum cut from wi to vi by up to eo- Continuing in this way, 

E2 < E - Eq + Eq - El + El = E, 

and so on up to Em- This completes the proof. ■ 

That J^T violates decomposition invariance now follows from Theorem |2l part (i). However, it 
might be helpful to see an explicit counterexample. Let / be the 1-qubit maximally mixed state, 
and let Rj^/^ be a 7r/4 rotation. Then R^/4^IR~^^ = I, and 



1 
1 



since the 1/2 unit of flow from |0) all gets routed to |0), and then the 1/2 unit of flow from |1) can 
only be routed to |1). On the other hand, let \(pe) = cos 9\0) + sin 9\1); then S ([(/^^/g) , Rt^/^) and 
5 (Ivsvr/s) ^^tt/a) clearly do not equal the identity, since cos^ (tt/S) units of flow cannot be routed 
along an edge of capacity only l/\/2- Therefore 



S{l,R^/4) + 



S (Iv'tt/s) ,Rn/4) + S (I'/^Stt/s) ,-^77/4) 



Let us also show that J^T violates product commutativity. Let 



be a 



2-qubit initial state, and let and R^/^ be 7r/4 rotations applied to the first and second qubits 



respectively. Suppose is applied first to change the second qubit from |(/?-7r/8/ to l^Pn/s}- 

Then one can check that ^ cos^ ^ probability mass is routed from |0) to |0), and ^ sin^ ^ from |1) to 
|1); the edge capacities never come into play. So S (^\ip) , R^/^j is the identity, which implies 

that 



SIR. 



'n/4 



' ^7r/4 



,R^/4)=s{\ip^/,)<S,\^^/s),R^/4 





10 10 
10 1 



On the other hand, if is applied first to change the state to \l) j'/'.jr/g), then when R^^^ is 

applied next, at most l/\/2 of the cos^ ^ probability mass at |10) can be routed to |10); the rest 
must go to 1 11). It follows that 



S ( 



B 

■7r/4 



s 



' ^7r/4 



^S{R^M,R^,AS 



r/4 



■7r/4 



,R 



■7r/4 
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4.2 Schrodinger Theory 

Our final hidden- variable theory, which we call the Schrodinger theory or ST, is the most interesting 
one mathematically. The idea — to make a matrix into a stochastic matrix via an iterative process 
of row and column rescaling — is natural enough that we came upon it independently, only later 
learning that it originated in a 1931 paper of Schrodinger Schrodinger gave a pair of functional 
integral equations that such an iterative process would solve, but was unable to prove that those 
equations always have a solution. The existence and uniqueness of a solution were shown under 
broad conditions by Nagasawa TH" , building on earlier work of Fortet [T^ and Beurling W . Our 
goal is to give what (to our knowledge) is the first self-contained, reasonably accessible presentation 
of the main result in this area; and to interpret that result in what we think is the correct way: 
as providing one example of a hidden-variable theory, whose strengths and weaknesses should be 
directly compared to those of other theories. 

Most of the technical difficulties in |^ ^1 ^1 arise because the stochastic process being 
constructed involves continuous time and particle positions. Here we eliminate those difficulties by 
restricting attention to discrete time and to finite-dimensional Hilbert spaces. We thereby obtain 
a generalized version^ of a problem that computer scientists know as {r,c) -scaling of matrices. 
Sinkhorn [23J gave an algorithm for the (r, c)-scaling problem, which was shown to run in polynomial 
time by Franklin and Lorenz (see also Linial, Samorodnitsky, and Wigderson jl£j). 

As in the case of the flow theory, given a unitary U acting on a state p, the first step is to 
replace each entry of U by its absolute value, obtaining the nonnegative matrix C/(o) defined by 
[U^^^).^ := iU)^j . We then repeatedly tweak U^^^ to bring it closer to a joint probabilities matrix 

P{p,U). We want to make the i*^ column of the matrix sum to {p)^, and the j^^ row sum to 
{U pU^"^^ .. for all z,j G {l,--- iN}. The stochastic matrix S {p,U) is then readily obtained by 
normalizing each column to sum to 1. 

The algorithm is iterative. For each t > we obtain by normalizing each column i of 

C/^^*^ to sum to (/j)jj; likewise we obtain C/^^*"''^-* by normalizing each row j of J7(^*+-^) to sum to 
{U pU^"^^ ... More formally, 



f/(2^+l)^ ^ ip)ii I jj{2t) 



The crucial fact is that the above iteration converges. Our proof will reuse a result about network 
flows from Section [4.11 in order to define a nondecreasing "progress measure" based on Kullback- 
Leibler distance. 

Theorem 5 The limit U^°°^ = limj^oo U^^^ exists. 

Proof. A consequence of Theorem El is that for every p, U, there exists an N x N array of 
nonnegative real numbers fij such that 



(1) fij = whenever ([/). • 



0, 



■^In (r, c)-scaling, we are given an invertible real matrix, and the goal is to rescale all rows and columns to sum to 
The generalized version is to rescale the rows and columns to given values (not necessarily 1). 
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(2) fii-\ \- fiN = {p)ii for all i, and 

(3) hj + --- + fNj = {UpU~').. for all j. 
Given any such array, define a progress measure 



where we adopt the convention OP = 1. We claim that Z^^'^^^ > Z^^^ for all t > 1. To sec this, 
assume without loss of generality that we are on an odd step 2t + 1, and let cf *^ = {U^'^*^).. 
be the i*'* column sum before we normalize it. Then 




■+fiN^ 



As a result of the 2t*'* normalization step, we had Yli^i^^^ — 1- Subject to that constraint, the 
maximum of 

n(of")'"' 

i 

over theCf)'s occurs when C^^^ = for all i — a simple calculus fact that follows from the non- 
negativity of Kullback-Leibler distance. This implies that Z^'^*'^^^ > Z^'^*^ . Similarly, normalizing 
rows leads to zP*+2) > 

It follows that the limit = lim^^oo U^^'^ exists. For suppose not; then some cf^ is bounded 
away from so there exists an e > such that > (1 + £) 2^^*-* for all even t. But this is 

a contradiction, since > and < 1 for all t. ■ 

It is immediate that ST satisfies symmetry and indifference. Let us show that it satisfies 
product commutativity as well. 

Proposition 6 ST satisfies product commutativity. 

Proof. Given a state = H^a) ® H^b), let Ua 'Si I act only on \jPa) and let / Ub act only 
on IV'b)- Then we claim that 

S{\^I;),Ua(^I) = S{\^I;a),Ua)SI. 

The reason is simply that multiplying all amplitudes in \iPa) and Ua I'iPa) by a constant factor 
ax, as we do for each basis state \x) of \^b), has no effect on the scaling procedure that produces 
S{\^Pa),Ua). Similarly 

S{\i^),lS>UB) = lS>S{\i;B),UB). 
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It follows that 



S H^a) ,Ua)(S)S (IVb) ,Ub) = S {Ua \^PA) ® \^Pb) ,I(S)Ub)S ,Ua(^I) 

= Si\^A)^UB\^B),UA0l)S{\i;),I^UB). 



On the other hand, let us show that ST violates decomposition invariance. Using the same 



notation as in Section 14.11 we have RT^/glR^j^^ = I, R^^ 
1933^/4), from which it can be calculated that 



S (I, Rt,/s) 



Hence 



= I, 




8 |<y^7r/8 


' 


707 


0.293 





293 


0.707 


" 


555 


0.177 





445 


0.823 


" 


177 


0.555 





823 


0.445 



|(^^/4>, and R^/8 |(^5^ 



/8/ 



S (I, R^i 



5 Discussion 



The idea that certain observables in quantum mechanics might have trajectories governed by dy- 
namical laws has reappeared many times: in Schrodinger's 1931 stochastic approach j22j . Bohmian 
mechanics j3], modal interpretations [21IH1E1) and elsewhere. Yet because all of these proposals 
yield the same predictions for single-time probabilities, if we are to decide between them it must 
be on the basis of internal mathematical considerations. A main message of this paper has been 
that such considerations can actually get us quite far. 

To focus attention on the core issues, we restricted attention to the simplest possible setting: 
discrete time, a finite-dimensional Hilbert space, and a single orthogonal basis. Within this setting, 
we proposed what seem like reasonable axioms that any hidden-variable theory should satisfy: for 
example, symmetry under permutation of basis states, robustness to small perturbations, and 
independence of the temporal order of spacelike-separated events. We then showed that not all 
of these axioms can be satisfied simultaneously. But perhaps more surprisingly, we also showed 
that certain subsets of axioms can be satisfied for highly nontrivial reasons. In showing that the 
indifference and robustness axioms can be simultaneously satisfied, Sectionl^revealed an unexpected 
connection between unitary matrices and the classical theory of network flows. 

As mentioned previously, the most important open problem is to show that the Schrodinger 
theory satisfies robustness. Currently, we can only show that the matrix Pst (p, U) is robust to 
exponentially small perturbations, not polynomially small ones. The problem is that if any row or 
column sum in the f7(*) matrix is extremely small, then the (r, c)-scaling process will magnify tiny 
errors in the entries. Intuitively, though, this effect should be washed out by later scaling steps. 

A second open problem is whether there exists a theory that satisfies indifference, as well as 
commutativity for all separable mixed states (not just separable pure states). A third problem is 
to investigate other notions of robustness — for example, robustness to small multiplicative rather 
than additive errors. 
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